[One Workflow](scale): Lazy-load workflow step I/O by rosomri · Pull Request #253547 · elastic/kibana

rosomri · 2026-02-17T18:59:59Z

Summary

Reduces memory pressure and network payload size by lazy-loading workflow step execution I/O data instead of fetching it all upfront.

Screen.Recording.2026-02-18.at.13.18.51.mov

Lazy-load step I/O: Execution polling (loadExecutionThunk) now requests lightweight data (includeInput=false, includeOutput=false). Full step input/output is fetched on demand — when the user clicks a step's tab or hovers a template expression in the YAML editor.
Server-side source filtering: getWorkflowExecution accepts includeInput/includeOutput query params and applies _source_excludes on Elasticsearch mget/search calls, avoiding large payloads that can cause OOM.
Bidirectional React Query cache: Step I/O fetched by the execution detail panel (useStepExecution) or by the YAML editor hover provider share a single cache via queryClient.setQueryData, preventing duplicate HTTP requests regardless of access order.
Cache cleanup on execution switch: Cached step data is cleared (removeQueries) when navigating to a different execution, preventing memory buildup.
Template hover priority: Reordered provideCustomHover so template expression hovers ({{ }}) take precedence over validation decoration tooltips.
Pure hover enrichment: Refactored ensureStepData → fetchStepDataIfNeeded to return enriched data instead of mutating the shared executionContext ref. Removed redundant fetchedStepIds tracking that caused a caching bug on repeated hovers.
Extracted useLazyStepExecutionFetcher hook: Moved inline fetch logic out of the YAML editor component into a dedicated hook for readability and testability.
Narrowed memo deps: tabs memo in WorkflowStepExecutionDetails now depends on hasInput/hasError booleans instead of the full stepExecution object.

Example flows

1. Execution polling — lightweight, no I/O

GET /api/workflowExecutions/exec-123?includeInput=false&includeOutput=false

Returns execution metadata and step statuses/durations, but input and output fields are excluded at the Elasticsearch _source level. This runs every poll cycle.

2. Hovering a template expression — lazy fetch + cache

User hovers {{ steps.search.output.hits }} in the YAML editor:

1. Hover provider calls fetchStepExecutionData("search")
2. Hook maps "search" → step doc ID "step-doc-abc"
3. React Query cache miss → GET /api/workflowExecutions/exec-123/steps/step-doc-abc
4. Response stored in cache: queryClient.setQueryData(["stepExecution", "exec-123", "step-doc-abc"], data)
5. Hover tooltip shows the resolved value

Second hover on the same step (or any steps.search.* expression):
1. fetchStepExecutionData("search") → cache hit → no HTTP request
2. Hover tooltip shows the resolved value immediately

For terminal steps, useStepExecution uses staleTime: Infinity — the cached data never goes stale for the lifetime of that execution.

3. Opening the I/O tab — served from cache

After the hover above already fetched step-doc-abc, user clicks the step and opens the Output tab:

1. useStepExecution("exec-123", "step-doc-abc", "completed") runs
2. React Query finds ["stepExecution", "exec-123", "step-doc-abc"] in cache
3. No HTTP request — data renders immediately

This works in both directions: if the user clicks the Output tab first, the hover provider finds the data in cache on subsequent hovers.

4. Switching execution — cache cleanup

1. User selects execution "exec-456"
2. useEffect cleanup fires: queryClient.removeQueries({ queryKey: ["stepExecution", "exec-123"] })
3. All cached step I/O for the previous execution is evicted
4. Fresh lightweight polling starts for "exec-456"

Test plan

get_workflow_execution.test.ts — Verifies _source_excludes is correctly passed to esClient.mget and searchStepExecutions based on includeInput/includeOutput flags
get_workflow_execution_by_id.test.ts — Updated existing route tests; added cases verifying query params are parsed and forwarded to the API layer
use_step_execution.test.ts — Verifies staleTime: Infinity and no polling for terminal steps; polling at 5s for running steps; polling stops on status transition
workflow_execution_detail.test.tsx — Verifies removeQueries is called on unmount and when executionId changes
unified_hover_provider.test.ts — Verifies hover values persist across multiple invocations, enrichment skipped when output already present, graceful fallback when fetch returns null
workflow_yaml_editor.test.tsx — Updated test wrapper to include QueryClientProvider for useLazyStepExecutionFetcher

…step I/O data when switching to a different execution

…xecution_api

… into break_execution_api

…xecution_api

semd · 2026-02-18T18:21:28Z

+  includeInput = true,
+  includeOutput = true,


When includeInput and includeOutput are omitted (both are optional), the API includes both by default.

From an API semantics perspective, that feels counterintuitive. Optional flags that default to “included” can be surprising, especially if they affect payload size or sensitive data exposure.

Would it make more sense to default both to false, and only include input/output when explicitly requested? That would make the API more explicit and predictable.

Agreed - I initially set them to true by default for backward compatibility, but you’re right. I’ll switch them to false and share an update in the channel.

…xecution_api

semd

This is a great job @rosomri 🎸
LGTM!

elasticmachine · 2026-02-19T10:29:54Z

💔 Build Failed

Buildkite Build
Commit: 5cbe34a

Failed CI Steps

Test Failures

[job] [logs] Jest Tests #3 / WorkflowsService getWorkflowExecution should return workflow execution with steps
[job] [logs] Jest Tests #3 / WorkflowsService getWorkflowExecution should return workflow execution with steps

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`workflowsManagement`	1285	1287	+2

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`workflowsManagement`	1.5MB	1.5MB	+2.2KB

History

…xecution_api

## Summary Reduces memory pressure and network payload size by lazy-loading workflow step execution I/O data instead of fetching it all upfront. https://github.com/user-attachments/assets/2d77d88d-4017-44bd-8581-082717352921 - **Lazy-load step I/O**: Execution polling (`loadExecutionThunk`) now requests lightweight data (`includeInput=false`, `includeOutput=false`). Full step input/output is fetched on demand — when the user clicks a step's tab or hovers a template expression in the YAML editor. - **Server-side source filtering**: `getWorkflowExecution` accepts `includeInput`/`includeOutput` query params and applies `_source_excludes` on Elasticsearch `mget`/`search` calls, avoiding large payloads that can cause OOM. - **Bidirectional React Query cache**: Step I/O fetched by the execution detail panel (`useStepExecution`) or by the YAML editor hover provider share a single cache via `queryClient.setQueryData`, preventing duplicate HTTP requests regardless of access order. - **Cache cleanup on execution switch**: Cached step data is cleared (`removeQueries`) when navigating to a different execution, preventing memory buildup. - **Template hover priority**: Reordered `provideCustomHover` so template expression hovers (`{{ }}`) take precedence over validation decoration tooltips. - **Pure hover enrichment**: Refactored `ensureStepData` → `fetchStepDataIfNeeded` to return enriched data instead of mutating the shared `executionContext` ref. Removed redundant `fetchedStepIds` tracking that caused a caching bug on repeated hovers. - **Extracted `useLazyStepExecutionFetcher` hook**: Moved inline fetch logic out of the YAML editor component into a dedicated hook for readability and testability. - **Narrowed memo deps**: `tabs` memo in `WorkflowStepExecutionDetails` now depends on `hasInput`/`hasError` booleans instead of the full `stepExecution` object. ### Example flows **1. Execution polling — lightweight, no I/O** ``` GET /api/workflowExecutions/exec-123?includeInput=false&includeOutput=false ``` Returns execution metadata and step statuses/durations, but `input` and `output` fields are excluded at the Elasticsearch `_source` level. This runs every poll cycle. **2. Hovering a template expression — lazy fetch + cache** User hovers `{{ steps.search.output.hits }}` in the YAML editor: ``` 1. Hover provider calls fetchStepExecutionData("search") 2. Hook maps "search" → step doc ID "step-doc-abc" 3. React Query cache miss → GET /api/workflowExecutions/exec-123/steps/step-doc-abc 4. Response stored in cache: queryClient.setQueryData(["stepExecution", "exec-123", "step-doc-abc"], data) 5. Hover tooltip shows the resolved value Second hover on the same step (or any steps.search.* expression): 1. fetchStepExecutionData("search") → cache hit → no HTTP request 2. Hover tooltip shows the resolved value immediately ``` For terminal steps, `useStepExecution` uses `staleTime: Infinity` — the cached data never goes stale for the lifetime of that execution. **3. Opening the I/O tab — served from cache** After the hover above already fetched `step-doc-abc`, user clicks the step and opens the Output tab: ``` 1. useStepExecution("exec-123", "step-doc-abc", "completed") runs 2. React Query finds ["stepExecution", "exec-123", "step-doc-abc"] in cache 3. No HTTP request — data renders immediately ``` This works in both directions: if the user clicks the Output tab first, the hover provider finds the data in cache on subsequent hovers. **4. Switching execution — cache cleanup** ``` 1. User selects execution "exec-456" 2. useEffect cleanup fires: queryClient.removeQueries({ queryKey: ["stepExecution", "exec-123"] }) 3. All cached step I/O for the previous execution is evicted 4. Fresh lightweight polling starts for "exec-456" ``` ## Test plan - [x] `get_workflow_execution.test.ts` — Verifies `_source_excludes` is correctly passed to `esClient.mget` and `searchStepExecutions` based on `includeInput`/`includeOutput` flags - [x] `get_workflow_execution_by_id.test.ts` — Updated existing route tests; added cases verifying query params are parsed and forwarded to the API layer - [x] `use_step_execution.test.ts` — Verifies `staleTime: Infinity` and no polling for terminal steps; polling at 5s for running steps; polling stops on status transition - [x] `workflow_execution_detail.test.tsx` — Verifies `removeQueries` is called on unmount and when `executionId` changes - [x] `unified_hover_provider.test.ts` — Verifies hover values persist across multiple invocations, enrichment skipped when output already present, graceful fallback when fetch returns null - [x] `workflow_yaml_editor.test.tsx` — Updated test wrapper to include `QueryClientProvider` for `useLazyStepExecutionFetcher` --------- Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

…load I/O change (#254087) ## Summary Fixes workflow execution output retrieval for agent-builder consumers. After [#253547](#253547) changed `getWorkflowExecution` to exclude step I/O by default, `getExecutionState` was no longer receiving step outputs - causing `getWorkflowOutput` to always return `null`. This passes `includeOutput: true` explicitly so the output is available when the execution completes.

rosomri added 6 commits February 17, 2026 16:02

includeInput/includeOutput query params

77f117e

lazy-load step execution I/O and exclude from polling + Clear cached …

9818adf

…step I/O data when switching to a different execution

linting

4005027

working with debug logs

c1981ee

hover to lazy load step IO

39db75b

Merge branch 'main' of https://github.com/elastic/kibana into break_e…

617b77c

…xecution_api

rosomri requested a review from a team as a code owner February 17, 2026 18:59

rosomri added release_note:skip Skip the PR/issue when compiling release notes backport:skip This PR does not require backporting Team:One Workflow Team label for One Workflow (Workflow automation) labels Feb 17, 2026

rosomri marked this pull request as draft February 17, 2026 19:00

kibanamachine and others added 6 commits February 17, 2026 19:19

Changes from node scripts/eslint_all_files --no-cache --fix

baf1920

resolve self CR comments

901acce

add tests for fetcher

c6c4f9d

CR second iter

9803c44

remove commented console logs

bf8f496

Merge branch 'main' of https://github.com/elastic/kibana into break_e…

184be5b

…xecution_api

rosomri marked this pull request as ready for review February 18, 2026 11:28

kibanamachine and others added 7 commits February 18, 2026 11:49

Changes from node scripts/eslint_all_files --no-cache --fix

cfbad8d

Only trust the cache for terminal steps

a469d37

Merge branch 'break_execution_api' of https://github.com/rosomri/kibana…

c4a871d

… into break_execution_api

type

37bedda

Merge branch 'main' of https://github.com/elastic/kibana into break_e…

6ac28b7

…xecution_api

Merge branch 'main' into break_execution_api

8f5565a

Merge branch 'main' into break_execution_api

419a7e7

semd reviewed Feb 18, 2026

View reviewed changes

rosomri and others added 3 commits February 19, 2026 10:57

Merge branch 'main' of https://github.com/elastic/kibana into break_e…

bc0bf70

…xecution_api

indlude IO to false by default

e0adfaa

Merge branch 'main' into break_execution_api

5cbe34a

rosomri requested a review from semd February 19, 2026 09:18

rosomri enabled auto-merge (squash) February 19, 2026 10:12

semd approved these changes Feb 19, 2026

View reviewed changes

rosomri added 2 commits February 19, 2026 13:55

Merge branch 'main' of https://github.com/elastic/kibana into break_e…

c2d7472

…xecution_api

'should return workflow execution with steps, excluding I/O by default

ddf4b4a

rosomri merged commit 1094e4e into elastic:main Feb 19, 2026
16 checks passed

kibanamachine added the v9.4.0 label Feb 19, 2026

rosomri mentioned this pull request Feb 19, 2026

(fix): agent-builder to include workflow execution output after lazy-load I/O change #254087

Merged

shahargl mentioned this pull request Mar 24, 2026

[One Workflow] Add entries Liquid filter for iterating over object keys #259249

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[One Workflow](scale): Lazy-load workflow step I/O #253547

[One Workflow](scale): Lazy-load workflow step I/O #253547
rosomri merged 24 commits intoelastic:mainfrom
rosomri:break_execution_api

rosomri commented Feb 17, 2026 •

edited

Loading

Uh oh!

semd Feb 18, 2026

Uh oh!

rosomri Feb 19, 2026

Uh oh!

rosomri Feb 19, 2026

Uh oh!

semd left a comment

Uh oh!

elasticmachine commented Feb 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rosomri commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Example flows

Test plan

Uh oh!

semd Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

rosomri Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

rosomri Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

semd left a comment

Choose a reason for hiding this comment

Uh oh!

elasticmachine commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💔 Build Failed

Failed CI Steps

Test Failures

Metrics [docs]

Module Count

Async chunks

History

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rosomri commented Feb 17, 2026 •

edited

Loading

elasticmachine commented Feb 19, 2026 •

edited

Loading